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Abstract 

Polygalacturonases (PGs) are hydrolytic enzymes employed by several phytopathogens to weaken the plant 
cell wall by degrading homopolygalacturonan, a major constituent of pectin. Plants fight back by employing 
polygalacturonase-inhibitor proteins (PGIPs). The present study compared the inhibition potential of pearl millet 
PGIP (Pennisetum glaucum; Pg/PGIP1) with the known inhibition of Phaseolus vulgaris PGIP (PvPGIP2) against 
two PGs, the PG-II isoform from Aspergillus niger (iAnPGII) and the PG-III isoform from Fusarium moniliforme 
(FmPGIII). The key rationale was to elucidate the relationship between the extent of sequence similarity of the 
PGIPs and the corresponding PG inhibition potential. First, a pearl millet pg/p gene {Pglpgipl) was isolated and 
phylogenetically placed among monocot PGIPs alongside foxtail millet (Setaria italica). Upstream sequence anal- 
ysis of Pglpgipl identified important c/s-elements responsive to light, plant stress hormones, and anoxic stress. 
Pg/PGIP1, heterologously produced in Escherichia coli, partially inhibited AnPGW non-competitively with a pH 
optimum between 4.0 and 4.5, and showed no inhibition against FmPGIII. Docking analysis showed that the 
concave surface of Pg/PGIP1 interacted strongly with the N-terminal region of iflnPGII away from the active site, 
whereas it weakly interacted with the C-terminus of FmPGIII. Interestingly, Pg/PGIP1 and PvPGIP2 employed 
similar motif regions with few identical amino acids for interaction with 4nPGII at non-substrate-binding sites; 
however, they engaged different regions of 4nPGII. Computational mutagenesis predicted D126 (Pg/PGIP1)-K39 
(AnPGW) to be the most significant binding contact in the Pg/PGIP1-4/7PGII complex. Such protein-protein inter- 
action studies are crucial in the future generation of designer host proteins for improved resistance against ever- 
evolving pathogen virulence factors. 

Keywords: Computational mutagenesis, electrostatic surface potential, inhibition studies, pearl millet, Phaseolus vulgaris, 
PGIPs, PGs, protein modelling and docking. 
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Introduction 

Pectin, a galacturonic acid-rich complex polysaccha- 
ride found in all plant cell walls (CWs), is composed of 
homogalacturonan (HG), xylogalacturonan, apiogalactu- 
ronan, rhamnogalacturonan I, and rhamnogalacturonan II 
(Vorwerk et al., 2004). HG, a linear homopolymer of a-1,4- 
linked D-galactopyranosyluronic acid with varying extents 
of methylation and acetylation, is the most abundant 
component (Mohnen, 2008). Phytopathogens are known 
to produce endo- and exo-polygalacturonases (PGs) that 
can breakdown HG (Van den Brink and De Vries, 2011). 
As a counter stratagem, plants employ polygalacturonase- 
inhibitor proteins (PGIPs) to modulate PG activity leading 
to an accumulation of elicitor-active ohgogalacturonides 
(De Lorenzo et al., 2001). PGIPs are CW-bound glycopro- 
teins belonging to the extracyoplasmic leucine-rich repeat 
(LRR) family of proteins (Shanmugam, 2005). PGIPs are 
very diverse in their PG-inhibition specificity and poten- 
tial with varying degrees of inhibition (Gomathi and 
Gnanamanickam, 2004). 

PG-PGIP complexes are considered a model protein- 
protein interaction system in the backdrop of plant-patho- 
gen interactions (Misas-Villamil and Van der Hoorn, 2008). 
Although three-dimensional structures of many PGs have 
been elucidated to date (Pickersgill et al, 1998; Federici 
et al, 1999; Bonivento et al, 2008), the only PGIP whose 
crystal structure has been solved is that of PvPGWl from 
Phaseolus vulgaris (Di Matteo et al, 2003). Most of the data 
available on the PG-PGIP interactions has been a result of 
studies involving /'vPGIP2. Previous studies employed tar- 
geted mutation of pg and pgip genes, and investigated the in 
vitro inhibition behaviour of the protein variants synthesized 
to identify the amino acid residues involved in the protein- 
protein interactions (Leckie et al, 1999; Mattei et al, 2001; 
Raiola et al, 2008). Amide-exchange mass spectrometry in 
combination with protease protection and fluorescence spec- 
trometric analysis was employed to deduce the amino acids 
of ^«PGII, a PG isoform II from Aspergillus niger, required 
for interaction with /'vPGIP2 (King et al, 2002). The avail- 
ability of advanced bioinformatic tools for protein homol- 
ogy modelling and docking have been exploited in in-depth 
analysis of PG-PGIP complexes in silico and found to be in 
conformity with the experimental results (Lim et al, 2009; 
Maulik et al, 2009). 

In contrast to the magnitude of literature available on dicot 
PGIPs, information available in case of monocots is mea- 
gre. Although PGIPs from wheat and rice have been tested 
for inhibition against various PGs (Jang et al, 2003; Kemp 
et al., 2003; Janni et al, 2006, 2013), no efforts have gone into 
understanding their mode of inhibition and the underlying 
structural basis of their interaction with PGs. In addition, no 
attempt has been made towards characterization of PGIPs 
from millets, small-grained gramineous monocots. Pearl mil- 
let [Pennisetum glaucum (L.) R. Br.; synonym: Cenchrus amer- 
icanus (L.) Morrone], is among the most important cereal 
crops grown in the semi-arid tropical regions of Africa and 
the Indian subcontinent (Sehgal et al, 2012). 



In the present study, the gene encoding pearl millet PGIP 
was isolated and expressed heterologously as a maltose-bind- 
ing protein (MBP) fusion in Escherichia coli. The purified 
recombinant fusion protein was employed in in vitro inhibi- 
tion studies against two fungal PGs, y4«PGII and FmPGIII 
(PG isoform III from Fusarium moniliforme isolate PD). 
PvPGIP2, the most wide-spectrum and potent inhibitor of 
fungal PGs (Farina et al., 2009), has been shown to inhibit 
^«PGII (Stotz et al, 2000) and is reported to be ineffective 
against FmVGlll (Sella et al, 2004). A study of the inhibi- 
tion profile of pearl millet PGIP against the same two PGs 
was carried out to elucidate the relationship between the 
extent of sequence similarity and the corresponding ability 
to inhibit PG. Furthermore, in the present study, in silico pro- 
tein modelling, docking, and mutation analyses were carried 
out to explain the in vitro results, gain an understanding of 
the underlying structural basis of interaction, and predict 
the putative amino acids involved. To the best of our knowl- 
edge, this is the first report on the production of recombinant 
PGIP from millets and exploration of its inhibitory potential. 

Materials and methods 

Plant material 

Seeds of pearl millet (P. glaucum) cultivar IP 18296, obtained from 
the International Crops Research Institute for the Semi-Arid 
Tropics, Hyderabad, India, were used in this study. The seeds were 
germinated on moist germination sheets; 2-d-old seedlings were har- 
vested and stored at -80 °C till further use. 



Isolation of nucleic acids 

Total RNA and genomic DNA were extracted from 2-d-old pearl 
millet coleoptiles using a Total Plant RNA Isolation kit (Sigma) and 
a GeneJET'''" Plant Genomic DNA Purification Mini kit (Thermo 
Scientific), respectively, as per the manufacturer's instructions. 



Isolation and cloning of partial pgip genes 

The cDNA was prepared by means of a two-step AMY 
RT-PCR kit (Qiagen) as per the manufacturer's instructions, 
using total RNA from pearl millet coleoptiles as template. For 
PGR amplification of the partial pgip genes, primers Pari For 
(5'-CTCGACCTCTCCTTCAACTC-3')/ParlRev (5'-ATGCCGCC 
GTAGATGGCGTT GTG-30 and Par2For (5'-TGCGACTGGTA 
CGACGTCGACTG-3')/Par2Rev(5'-TCGCCACCTGCGCCGGG 
ATG-3') were designed based on the consensus region obtained 
by alignment of known monocot pgip gene sequences (GenBank 
accession nos AM 180652-AM 180657, NP_001147231, and 
XP_002439099) (Clone Manager Professional 9, Sci-Ed soft- 
ware). The amplification using 727^ DNA polymerase (Merck 
Biosciences) resulted in ~400bp (Pglpgiplp) and ~500bp (Pglpgip2p) 
bands, respectively. They were gel purified using a QIAquick Gel 
Extraction kit (Qiagen), cloned in pTZ57R/T (InsTA clone PCR 
Cloning kit; Fermentas), and sequenced at Eurofins MWG Operon, 
Germany. Nucleotide BLAST (http://blast.ncbi.nlm.nih.gov/Blast. 
cgi) searches were performed using default parameters to ascertain 
the gene identity. 

Southern blot analysis 

Pearl millet genomic DNA (10 jig) was digested with restriction 
endonucleases separately: Apol (New England Biolabs, Germany), 
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Ahdl, BamHl, EcoRl, Hindlll, Kpnl, Mscl, Nael, Sad, Xhal, and 
Xliol (Thermo Scientific) and separated electrophoretically on a 
0.7% agarose gel as described by Sambrook and Russell (2001). 
The DNA was transferred to a positively charged nylon membrane 
(Roche) as per the manufacturer's instructions. As a hybridization 
probe, PCR-amplified Pglpgiplp was random-prime labelled using 
a DIG High Prime DNA Labeling and Detection Starter kit II 
(Roche). Pre-hybridization, hybridization, and chemiluminescent 
detection were performed as described previously (Wagenknecht 
and Meinhardt, 201 1). Hybridization was carried out at 63 °C. 

Inverse PCR 

Pearl millet genomic DNA (3 ]ig) was digested using Apol (New 
England Biolabs) and the digest mix was cleaned up using a 
NucleoSpin* Gel and PCR Clean-up kit (Macherey-Nagel). 
A 500 ng aliquot of the cleaned-up digest was self-ligated using a 
Rapid DNA Ligation kit (Thermo Scientific). 

The following two sets of inverse PCR primers, InvlA (5'- AC 
GCCTTCAGCTTCAACCTCTC-3')/InvlB (5'-TTGTGCGACA 
GCACTAGGGATG-3') and Inv2A (5'-AGAGGTAGATCTGGT 
CGGCG-3')/Inv2B (5'-CGCCAATTTCGCGCACC-30 were 
designed based on the Pglpgiplp and Pglpgiplp sequences, respec- 
tively. The PCR was carried out using Pliusion® Hot Start High- 
Fidelity DNA polymerase (Finnzymes) with 30 ng of self-ligated 
DNA as template. To the blunt-end PCR product of ~2kb obtained 
with InvlA/InvlB, an A' overhang was attached using Taq DNA 
polymerase (Thermo Scientific). It was further cloned in the pGEM- 
T Easy vector (Proniega) and sequenced at Eurofins MWG Operon, 
Germany. 

Sequence assembly 

Sequence assembly (Clone Manager Professional 9, Sci-Ed soft- 
ware) was carried out using sequences obtained by inverse PCR and 
the Pglpgiplp sequence to determine the open reading frame (ORF) 
(Pglpgipl). Nucleotide BLAST searches were performed to ascer- 
tain the gene identity. 

Bioinformatic analysis of Pglpgipl 

The deduced amino acid sequence of Pglpgipl (PglPGlPl) was 
further subjected to various in silico analyses using different bioin- 
formatics tools: the signal peptide was identified using SignalP 4.1 
(Petersen et ai, 2011) and the targeted protein localization was pre- 
dicted using WoLF PSORT (Nakai and Horton, 1999). The assign- 
ment of domains in Pg/PGIPl was performed based on the NCBI 
conserved domain search analysis (Marchler-Bauer et a/. , 201 1 ) and 
on a comparative amino acid sequence alignment with PvPGIP2 
domain architecture (Di Matteo et al. , 2003) using the T-Coffee tool 
(Notredame et al, 2000). The putative N-glycosylation sites were 
determined using NetNGly 1.0 (http://www.cbs.dtu.dk/services/ 
NetNGlyc/). 

The Pg/PGIPl and other known monocot and dicot PGIP 
sequences (protein accession numbers are summarized in 
Supplementary Table SI at JXB online) from the National Center 
for Biotechnology Information database were aligned with 
MUSCLE version 3.7 (Edgar et al, 2004) and gaps and/or poorly 
aligned regions were removed with Gblocks version 0.91b (Talavera 
and Castresana, 2007). A phylogenetic tree was generated using the 
maximum-likelihood method employed in PhyML version 3.0 aLRT 
(Anisimova and Gascuel, 2006) on the http://www.phylogeny.fr plat- 
form (Dereeper et al, 2008) using default settings. Internal branch 
consistency was appraised using the bootstrapping method with 100 
bootstrap replicates. The tree was rendered using TreeDyn version 
198.3 (Chevenet et a/., 2006). 

The nucleotide sequence upstream of the ORF was submitted to 
PlantCARE (Lescot et al , 2002) to determine the presence of plant- 
specific cw-elements. 



Construction of the Pg\PGIP1 and fmPGIII expression plasnnlds 
The multistep cloning strategy involved in construction of the 
PglPGlM expression plasmid (pET-22b::MBP-IEGR-Pg/PGIPI- 
6xHis-Strep-tag* 11, where lEGR is the factor Xa protease cleav- 
age site and 6xHis is a hexa-histidine tag) (construct E), vector 
control (pET-22b::MBP-IEGR-6xHis-Strep-tag® II) (construct F) 
and the i^mPGIII expression (pET-22b::/'mPGIII-Strep-tag® II) 
(construct G) plasmids have been detailed in Supplementary Table 
S2 at JXB online (Supplementary Fig. SI at JXB online provides 
a pictorial representation of the constructs). Briefly, the Pglpgpipl 
coding sequence was initially cloned in pET-22b(+) in frame with 
the vector-encoded 6xHis sequence. Then, malE (MBP-encoding 
gene), followed by a factor Xa protease recognition site was cloned 
upstream of pgip, and a Strep-tag® Il-encoding sequence was cloned 
downstream of 6xHis, allowing the synthesis of the fusion protein 
MBP-IEGR-Pg/PGIPl-6xHis-Strep-tag* II. The vector control 
was generated by eliminating the pgip sequence from construct 
E. FmPGIII was PCR amplified from the pGEMT-/>MPG/// con- 
struct and subcloned upstream of the Strep-tag® II in pET22b(+)- 
Strep-tag* II, allowing the synthesis of the P'«7PGIII-StrepII fusion 
protein. DNA manipulations such as agarose gel electrophoresis 
and bacterial transformation were carried out using standard pro- 
tocols (Sambrook and Russell, 2001). Restriction digestion (New 
England Biolabs), plasmid isolation (InnuPREP Plasmid Mini/Midi 
kit; Analytik Jena Biosciences), ligation (Rapid DNA Ligation kit; 
Thermo Scientific), and gel extraction (NucleoSpin* Gel and PCR 
Clean-up kit; Macherey-Nagel) were carried out using kits accord- 
ing to the manufacturer's instructions. 



Production and purification of Pg\PGIPl and 'FmPGIII fusion 
proteins 

Competent E. coli SHuffle® T7 Express [pLysSRARE2] was trans- 
formed with the above-described constructs. Expression was carried 
out in 2 1 batch cultures incubated for 24 h at 26 °C using auto- 
induction solutions 'M' and '5052' as described by Studier (2005). 
The total protein was extracted by a freeze-thaw cycle inducing 
the lysozyme-mediated autolysis of the cells. Additionally, sonica- 
tion on ice at 40% amplitude, three times for 1 min each (10 s on/ 10 
s off cycles) on a Branson Digital Sonifier 250-D (G. Heinemann 
Ultraschall- und Laborteclinik, Germany) was performed. The solu- 
ble protein was obtained as supernatant by centrifugation of the cell 
lysate at 40,000 g for 30 min at 4 °C. 

All purification steps were carried out at 10 °C on a FPLC system 
(AKTAExplorer; GE Healthcare, Freiburg, Germany). A flow rate 
of 1 ml min"' was maintained throughout. The protein peaks were 
pooled appropriately after each purification step and concentrated 
using centrifugal concentrators (Vivaspin'''"20; Sartorius). The 
intermediate buffer exchanges and desalting steps were carried out 
using 5 ml HiTrap columns (GE Healthcare). 

The fusion protein MBP-IEGR-P^/PGIPl-6xHis-Strep-tag® II 
(rP^/PGIPl) was purified in two steps. In the first step. Strep-tag® 
Il-based affinity purification was performed on a 1 ml Strep-Tactin 
Superflow Plus Cartridge (Qiagen) using 50 mM NaH2P04, 300 mM 
NaCl (pH 8.0) as loading/wash buffer and loading buffer containing 
2.5 mM D-desthiobiotin (pH 8.0; IBA Lifesciences) as elution buffer 
according to the manufacturer's instructions. The desalted eluates 
were reconstituted in cation-exchange column loading/wash buffer 
(buffer A: 50 mM NaH2P04, pH 8.0) and subjected to purification 
on a 1 ml RESOURCE Q Cartridge (GE Healthcare). The matrix- 
bound proteins were eluted (buffer B: 50mM NaH2P04, 1 M NaCl, 
pH 8.0) by applying a stepwise NaCl gradient [0-22% (buffer A to B) 
in 15 min, held for 2 min; 22-24% in 5 min, held for 2 min; 24-28% in 
5min, held for 2min; 28-100% in 20 min] to resolve the desired pro- 
tein from contaminating proteins. The rPgfPGlPl eluted at 22-24% 
NaCl held for 5 min. 

Recombinant P/7;PGIII-Strep-tag® II (rPwiPGIII) and the vec- 
tor control MBP-IEGR-6xHis-Strep-tag® II (rVC) were purified 
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by single-step affinity purification on a Strep-Tactin Superflow Plus 
Cartridge as described above. 

Protein production and purification were monitored by immuno- 
blot analysis of extracted proteins using Strep-Tactin®-horseradish 
peroxidase conjugate as probe (IBA Lifesciences). The chemilumi- 
nescence detection of blots was carried out as described previously 
(Wagenknecht and Meinhardt, 2011). The protein concentration 
of different samples was determined using a BCA protein assay kit 
(Pierce) with BSA as the standard. 

PGIP activity assays 

^nPGII (5ng; kind gift from Mr Madhusudhan, University of 
Mysore, Mysore, India) and rFinPGlll (36 ng) were incubated sepa- 
rately in a reaction volume of 200 pi with 0. 1 mg ml ' of polygalac- 
turonic acid substrate (Sigma) at 30 °C in 50 mM sodium acetate 
buffer (pH 4.2 and 4.6, respectively). PG activity was determined 
by reducing end-group analysis according to Antlion and Barrett 
(2002). PGIP activity was assayed by measuring the activity of 
the PGs pre-incubated with rPglPGlPl for 20min at 30 °C. PGIP 
activity was expressed as the percentage reduction in the number of 
reducing ends (in pkat mg"' of protein) liberated by PGs in the pres- 
ence and absence of PGIP. rVC served as the control. 

The effect of various parameters such as inhibitor concentration 
(0.316-12.64nM rPg/PGIPl), substrate concentrations (0.025- 
0.25 mg ml"') and pH (3.5, 4.0, 4.5, and 5.0) on enzyme inhibition 
was determined. The kinetic parameters were computed by fitting 
the Michaelis-Menten equation on initial rate experimental data by 
non-linear fitting using OriginPro? (Originlab). In separate experi- 
ments, the temperature and pH stability of rPg/PGIPl were studied 
by pre-incubating them separately for 1 h at temperatures ranging 
from 20 to 100 °C, and for 16h at pH values of 2.0-11.0 at 4 °C, 
respectively, after which they were reconstituted in the appropriate 
assay buffer and their inhibition potential was assayed at 30 °C. All 
experiments were performed twice each in triplicate. The data of a 
representative experiment was subjected to Tukey's honestly signifi- 
cant difference (HSD) test following analysis of variance at P<0.05. 

l-iomology modeiiing of Pg\PGIPl and FmPGIII 

The initial homology models of PglPGlPl and FmPGlll were gen- 
erated using the MODELLER 9.12 package (Eswar er al, 2006), 
with the X-ray crystallographic structures of Pv'PGIP2 [Protein 
Data Bank identity (PDB ID): lOGQ] and endo-polygalacturonase 
I (FniVGl) from F. moniliforme isolate FC-10 (PDB ID: 1HG8) 
serving as templates, respectively. The quality of the generated 
models was assessed using the Verify 3D server (Eisenberg et al, 
1997). Post-refinement of structural models was carried out using 
the KoBa'^""' server (Rodrigues et al, 2012), and further geometric 
accuracy of the constructed models was evaluated using MolProbity 
4.02b (Davis et al, 2007). The obtained models were further energy 
minimized using GROMACS 3.0 (Lindahl and Hess, 2001) with a 
GROMOS96-53a6 force field to remove geometric inaccuracies and 
steric clashes with 1000 steepest descent steps terminated at the con- 
vergence of maximum force <1000 kj niol"'. 



Electrostatic charge distribution analysis 

The adaptive Poisson-Boltzmann Solver program (Unni et al, 
201 1) was employed to calculate the electrostatic distribution of the 
following individual proteins and complexes: PglPGlPl, ^«PGII, 
FmPGlll, Pg/PGIPl-yl«PGII and P^/PGIPl-i^mPGIII. PyMOL 
version 1 .2r3pre (PyMOL Molecular Graphics System; Schrodinger, 
LLC) was used for the visualization of the surface representation. 

Docicing and energy minimization 

To obtain accurate conformations of Pg/PGIP1-^«PGII and 
PglPGlPl FmPGlll protein complexes, the docking simulation 



was carried out in two steps; in the first step, the protein pairs were 
subjected to GRAMM-X (Tovchigrechko and Vakser, 2006) for the 
initial global search of conformation of the two proteins in com- 
plex form and subsequently, such output was resubmitted to the 
Rosetta 3.4 server (Lyskov and Gray, 2008) for further optimization 
of docking results based on rigid body orientation and side-chain 
conformation. The three-dimensional structure of AnPGll (PDB 
ID: ICZF) used in the present docking study was retrieved from the 
PDB, Research Collaboratory for Structural Bioinformatics. 

For energy minimization, molecular dynamics simulations at 
room temperature were carried out using GROMACS 3.0 (Lindahl 
and Hess, 2001) with a GROMOS96-53a6 force field for bound 
PglPGIPl-AnPGIl and P^/PGIPl-FwiPGIII complexes. Protein in 
complexed form was immersed in a cubic waterbox of space water 
model with box edges 1 A from the complexed protein periphery. 
The overall electric charge of the system was compensated. The sys- 
tem was pre-equilibrated, which involved minimization using the 
steepest descent method. A molecular dynamics simulation was per- 
formed at constant volume and temperature for 1 ns. Snapshots were 
collected every 2 ps. 

Protein-protein interaction map 

Hydrogen bond, ionic, and hydrophobic interactions across pro- 
tein-protein interfaces were determined by the Protein Interaction 
Calculator (PIC) server (Tina et al, 2007). Interactions were visual- 
ized and analysed using molecular modelling programs PyMOL 1.6 
and CHIMERA 1.8 (Pettersen et al, 2004). 

Computational mutagenesis 

Computational alanine-scanning mutagenesis was carried out using 
the Robetta alanine-scanning server (Kim et al, 2004) in order to 
identify the energetically favourable amino acids at the interface that 
are important in the stability of the P^/PGIP1-^«PGII complex. 
This program scans the protein-protein interface for the 'hotspots' 
and evaluates the changes in the binding free energy of the complex 
by replacing each of the interface residue with alanine at the binding 
region. 

The relative free energy of binding (AAGbinding) was calculated as 
follows: 

^ ^ ^binding ^bmding_wild type ^bmding_mutaQt 

Positive values of AAGbinding higher than 1 kcal mol"' indicate that 
replacement by alanine is predicted to destabilize the complex. 



Results and discussion 

Isolation of pearl millet pgip genes 

Initially, two partial pgip sequences: Pglpgiplp (407 bp, 
GenBank accession no. GU474543.1) and Pglpgiplp (497 bp, 
GenBank accession no. JQ425039), which shared a nucleo- 
tide sequence identity of 84%, were isolated from pearl mil- 
let using primers based on the consensus sequence of known 
monocot pgips. PGIPs from the same plant, irrespective of 
whether they are of monocot or dicot origin, share significant 
identity (Favaron et al, 2000; Ferrari et al, 2003; Janni et al, 
2006). 

An inverse PGR approach was employed for the isolation 
of the complete pgip coding sequence. For this, a Southern 
blot analysis was performed to identify appropriate restriction 
endonucleases for template preparation. About 10 enzymes 
were tested; however, they did not or only partially cut the 
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genomic pearl millet DNA. Only Apol yielded a continu- 
ous digestion pattern and distinct hybridizing fragments of 
appropriate size (Supplementary Fig. S2 at JXB online) and 
thus was selected to prepare the template. Of the two primer 
pairs used in the inverse PGR, only InvPGIPlA/InvPGIPlB 
amplified an intense fragment of ~2.0kb, which was further 
cloned and sequenced. The sequence assembly resulted in a 
2295 bp contig (GenBank accession no. JF421287) with an 
ORE of 1014bp (Pglpgipl) (Supplementary Fig. S3 at JXB 
online). The pearl millet coding sequence was found to be 
uninterrupted, i.e. without any introns, upon comparison of 
the DNA and cDNA sequences. Most known PGIP-encoding 
genes have been shown to lack introns; however, exceptions 
do exist but only in case of dicots such as Arabidopsis, peach, 
raspberry and Antirrhinum majus (De Lorenzo et al, 2001). 

The Southern blot analysis of pearl millet genomic DNA 
and the isolation of two PGIP-encoding partial sequences 
indicated the possible occurrence of pgip in pearl millet as a 
small multigene family. However, further genome organiza- 
tion studies are necessary to confirm this. It has been reported 
that most plant pgip genes exist as small multigene families 
clustered within a specific chromosomal region (De Lorenzo 
et al, 2001). Brassica napiis, in contrast, is the only plant 
reported so far to contain a large gene family with at least 16 
pgips (Hegedus et al, 2008). 

Bioinformatic analysis of Pglpgipl 

The bioinformatic characterization of Pglpgipl was carried 
out to understand the domain architecture of the deduced 
protein, its phylogenetic position among known PGIPs, and 
nature of c/.s-regulatory motifs upstream of the ORE. Only 
130 (37%) amino acids were identical between Pg/PGIPland 
PvFGlPl; however, they were found to have a global alignment 



score of 95 (Supplementary Fig. S4 at JXB online). A putative 
signal peptide of 27 aa was identified (Fig. 1) for possible apo- 
plastic localization (82% affirmative), which is crucial in plant 
defence against the invading microbial PGs encountered at the 
CW (Di Matteo et a/., 2003). A central domain was detected in 
PglPGlPl, which contained 10 imperfect LRRs of an ~24 aa 
residue consensus sequence characteristic for PGIPs, xxLxLxx. 
NxLx..GxIPxxLxxL.xxL (Di Matteo et al, 2003). The pearl 
millet PGIP also contained two putative (3-sheets, Bl and B2, 
and also the 3io-helix similar to PvFGlFl. The central LRR 
domain has been shown in PvPGIP2 to fold into a right-handed 
superhelix and is connected to the Bl-sheet through loops or 
(3-tums. The B2-sheet is a unique feature not found in other 
LRR proteins (Di Matteo et al, 2003). In PglPGlPl, the cen- 
tral domain was flanked by short N- and C- terminal regions 
with conserved C residues as reported for PvPGIP2, known 
to be involved in disulfide linkages necessary for the structural 
integrity of the protein (Di Matteo et al, 2003). Seven puta- 
tive A'^-glycosylation sites were predicted by NetNGlyl.O with 
a consensus sequence of N-x-S/T (where x is any amino acid 
except P), for A^-glycosylation. Of the seven sites, positions 85 
and 297 were conserved in all four rice PGIPs ((9.sPGIPl-4) 
and both wheat PGIPs reported previously (Janni et al, 2006). 
N297 was the lone residue to be conserved even in PvFGlFl, 
whose glycosylation positions have been experimentally 
mapped (Mattel et al, 2001). N288 was conserved in all rice 
and wheat PGIPs except in O^PGIPl. N258 was conserved in 
Os¥GlP2 and OsFGlF3, whereas N177 was conserved only in 
the latter. 

Phylogenetic analysis of the deduced protein, PglPGlPl, 
and other available monocot and dicot PGIP sequences using 
the maximum-likelihood approach clearly showed monocot 
and dicot PGIPs forming separate clusters (Fig. 2). PglPGlPl, 
was placed among the monocot PGIPs but the millet PGIPs 



signal peptide 1 ksbckkrakrtkrailvvllvaaspaaa 
N-terminus 28 spirHhhddiaalaaigaafggycsawt 

QRD PE^GGG I H@D P FT GRV 



Central LRR-domain 
LRR-1 
LRR-2 
LRR- 3 
LRR- 4 



7 6 TDLAVFQ . DaISi T . GT I PDAV; 
101 RTLBOiHHLPAlS. .GPIPPAI 
126 TMLIISW.TGVS. -GPVPSFLG. 
150 TFLDLSF.NSLT. .GVIPAS 
LRR-5 174 NGiSlsR.NRLT. .GVIP 

* -GSI 



LRR- 6 199 ASLVLSH.NILN. 

LRR-7 222 SQIDLSR . NAFT . . GDi? 

LRR-8 246 QILDLSR . NAFS . . FN| 

LRR-9 269 TWLDLSH . NAIY . .GGII 

LRR-10 293 TQLFfivSYNQLg. .GAVPTG.GI 




Consensus sequence 



Bl-sheet 
XXLxLXX . NxLX. .GxIPI 



IL.XXL 



C-terminus 313 KGKFDAYSFQHNkSlSgaplanlSk 



Fig. 1 . Sequence organization of derived amino acid sequences of Pg/PGiPl . The displayed sequence organization is a resuit of aiignment of Pg/PGIP1 
with P\/PGIP2 whose secondary structure has been determined (Di Matteo ef a/., 2003). The LRR consensus sequence 'xxl_xl_xx.Nxl_x..GxiPxxLxxL.xxL' 
is shown. Putative residues contributing to form the secondary structure eiements are based on P\/PGIP2 and indicated in iight grey (sheets Bl and B2) 
and dark grey (Sm-heiix). Putative glycosylation sites are indicated by an asterisl<;. The conserved 0 residues are highiighted in biack. Numbering of amino 
acid residues is shown on the ieft. 
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Fig. 2. Phylogenetic tree showing the affiliation of Pg/PGiP1 among other l<nown monocot and dicot PGIPs. The deduced amino acid sequences of 
Pg/PGIP1 and other l<;nown PGiP sequences obtained from GenBanl< were aligned using MUSCLE version 3.7, curated using the Gbiocl<s version 
0.91 b, and then submitted to PhyML version 3.0 aLRT for phyiogenetic analysis, and the tree was rendered using TreeDyn version 1 98.3. The position of 
Pg/PGIPI is highlighted by a box. The branch support values are represented at branch points and the branch length scale is shown below the tree. The 
protein accession numbers are summarized in Supplementary Table SI . 



formed a separate branch. fg/PGIPl was found to share from 
50% to just over 60% identity with most monocot PGIPs. The 
identity with dicots was, however, between 35 and 47%. 

Earher evaluation of cw-regulatory elements of various 
plant pgip genes in silico detected elements responsive to light, 
wounding, salicylic acid, abscisic acid, fungal elicitors, ethyl- 
ene, and various other motifs with unknown functions (Kumar 
et al, 2009; Lu et al, 2012). The 1078 bp sequence upstream 
of the Pglpgipl ORF was analysed in silico for the identifica- 
tion of c«-elements and showed the presence of numerous 
important regulatory motifs. The consensus motif sequences 
and their respective functions retrieved from PlantCARE 
have been summarized in Supplementary Table S3 at JXB 
onhne (Supplementary Fig. S5 at JXB onhne shows the posi- 
tion of the cw-elements in the upstream sequence). The pres- 
ence of elements responsive to plant stress hormones (e.g. 



ACGT-containing abscisic acid response element, TC-rich 
repeats, TCA element, CGTCA motif, ethylene response ele- 
ments) and anoxic stress (anaerobic response element) indi- 
cates a role for PGIP in pearl millet's response to biotic and 
abiotic stress. In bean cultivar 'Pinto', PvPpgipl was found to 
be upregulated in response to wounding, oligogalacturonides, 
glucan derived from Phytophthora megasperma f sp. glycinea 
and salicyhc acid (D'Ovidio et al, 2004). A sHghtly higher 
accumulation of poplar Pdpgip4 transcripts was observed 
over that of Pdpgipl transcripts at 1 and 4h after treatment 
with salicylic acid and H2O2, respectively (Cheng et al , 2008). 
However, similar expression levels were reported for both 
pgip genes in case of methyl jasmonate treatment. In Chinese 
cabbage, pgip expression levels were increased in response to 
abiotic stresses such as water-logging, salt, cold treatment, 
and mechanical wounding (Ahsan et al, 2005). Significant 
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numbers of elements were found to be responsive to light 
(e.g. Box 1, G-Box, Specificity protein 1). The effect of light 
on transcription as well as post-transcriptional processes 
in plant growth and development has been reported previ- 
ously (Gilmartin et al., 1990). Plant defence genes have also 
been proposed to be circadian regulated, and biotic stress 
responses have been shown to be modulated by a Hght-sens- 
ing network in Arabidopsis (MulHneaux et al, 2000; Genoud 
et al, 2002; Wang et al, 2011). A plant's abihty to acclima- 
tize to changes in quality and intensity of light has been 
found to be critical in mounting an effective response against 
invading pathogens (Kulheim et al., 2002). Given the pres- 
ence of numerous light-responsive elements in Pglpgipl and 
other known pgips, and the fact that pgips are known to be 
induced by both biotic and abiotic stresses, the relationship 
between light-induced pgip expression and plant defence 
makes perfect sense. 

Production of recombinant Pg\PGIP1 and FmPGIII 

rPg/PGIPl was produced in order to assess its potential inhibi- 
tion against fungal PGs {rFmFGlll and ^kPGII). Our initial 
attempts to synthesize these proteins in eukaryotic expression 
systems such as Nicotiana benthamiana and Pichia pastoris were 
not successful (data not shown). Furthermore, Pg/PGIPl pro- 
duction in E. colt faced protein solubility and purification prob- 
lems. Hence, several solubility and affinity purification fusion 
tags were tried out for the soluble production and purification 
of Pg/PGIPl (data not shown). The N-terminal MBP and 
C-terminal Strep-tag® II combination of solubility and affin- 
ity purification tags, respectively, were found to be optimal for 
its expression in E. coll SHuffie® T7 Express (pLysSRARE2) 
and efficient purification. FmVGlW expression faced no solu- 
bility issues, and hence only a C-terminal Strep-tag® II was 
incorporated for purification purpose. Expression with pET- 
22b::MBP-IEGR-/'g/PGIPl-6xHis-Strep-tag® II in E. colt 
SHuffie® T7 Express (pLysSRARE2) and purification of the 
resulting recombinant fusion protein by coupling an anion- 
exchange column downstream of Strep-Tactin Superflow Plus 
affinity matrix resulted in a homogenous pure protein, rPglP- 
GIPl, of 79.1 kDa (Supplementary Fig. S6A at JXB online). 
The two-step purification resulted in 505- and 1 323-fold purifi- 
cation with protein recoveries of 37 and 33%, respectively. 

The single-step affinity purification of rFmPGIII and rVC 
on a Strep-Tactin Superflow Plus Cartridge resulted in rela- 
tively homogenous pure proteins of 37.9 kDa (Supplementary 
Fig. S6A) and 46.3 kDa (Supplementary Fig. S6B), respec- 
tively. Single-step purification of ri7?7PGIII resulted in 320- 
fold enrichment with a protein recovery of 72%. 

Immunoblot analysis confirmed the synthesis and success- 
ful purification of the recombinant proteins (Supplementary 
Fig. S6). In addition, tandem mass spectrometric analysis of 
trypsin-digested peptides of purified recombinant proteins 
was carried out. Subsequent correlation of the resulting spec- 
tral data using SEQUEST algorithm (Eng et al. , 2008) against 
a database containing information on theoretical digest of 
the respective fusion proteins confirmed their identity (data 
not shown). 



In vitro inhibition profiie of rPglPGiPl 

The in vitro inhibition of rPglPGlPl against ^nPGII and 
rFmPGlll was carried out in order to compare it with the 
known inhibition profile of Pv'PGIP2 against the same two 
PGs. Intact MBP-fg/PGIPl fusion protein was employed 
in the inhibition studies as protease factor Xa digestion for 
cleaving off MBP resulted in random degradation of protein 
(data not shown). rPg/PGIPl displayed a differential activity 
profile against the fungal PGs used in the present study, with 
only a partial inhibition observed against ^«PGII (Fig. 3A) 
and no inhibition against r/wPGIII. Hence, the effect of 
various parameters on AnPGll inhibition by rPglPGlPl has 
been presented below. The rVC, as expected, showed no inhi- 
bition of either enzyme. Possibly, the lack of glycosylation of 
rPg/PGIPl produced in E. coli could be a factor responsible 
for the limited inhibition observed. 

fvPGIP2 has been reported to competitively inhibit 
TtijPGI by binding at its substrate-binding site (Leckie et al. , 
1999), whereas FmVGlW, which shares 91.7% amino acid 
identity with FmVGl, evaded inhibition by the same PGIP 
(Sella et al, 2004). Although a structural explanation is lack- 
ing, an attempt was made to explain the observed effect using 
amino acid sequence alignment. The chief difference between 
the FmPGlW and FmVGl consist of one short stretch of sub- 
stitutions in the N-terminal region and five substitutions, two 
of which are close to the active site. These polymorphic sites 
may have led to subtle changes in the protein structure, thus 
preventing the accessibility of the substrate-binding site of 
TtijPGIII for interaction with /'vPGIP2 (Sella et al , 2004). As 
mentioned earlier, although sharing 37% amino acid identity, 
the global alignment score of Pg/PGIPl and /'vPGIP2 is very 
high due to the degeneracy of residues in the LRR domain. 
Hence, lack of potency of PglPGWl against 77iiPGIII is 
understandable. Assessment of the PglPGWl FmVGl inter- 
action could further be useful for a deeper comparative profil- 
ing of inhibition with i'vPGIP2. 

Effect of inhibitor concentration on AnPGII activity 

No inhibition was observed at the lowest rPg/PGIPl con- 
centration tested, but a gradual increase in inhibition was 
observed with increasing inhibitor concentration (Fig. 3A). 
A significant increase in inhibition was observed at iPglP- 
GIPl concentrations of 0.632, 1.26, and 3.16nM with inhi- 
bition being 7, 16, and 26%, respectively. However, a further 
increase of the rPg/PGIPl concentration (up to 6.32 and 
12.64 nM) resulted in marginally increased inhibition reach- 
ing a value of 28% at most. PGIP concentrations of 3.16nM 
and/or 1.26nM were used in further studies. 

Although the pattern of inhibition of both rPglPGlPl 
and PvFGlPl against the same two PGs was similar, bean 
PGIP showed complete inhibition against ^hPGII at 
concentrations lower than that used in the present study 
(Leckie et al, 1999; Sella et al, 2004). As observed in the 
present study, such partial inhibitions of various PGs have 
been reported in several PG/PGIP systems (Gomathi and 
Gnanamanickam, 2004). PGIPs from cotton showed 50 
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Fig. 3. AnPGII inhibition assay. (A) Effect of inhibitor concentration. AnPGW (5ng) was assayed with and without inhibitor (rPg/PGIPI/rVC) over a 
concentration range of 0. 316-12. 64nM and a graph with the enzyme activity over inhibitor concentration was piotted. (B) pH optimum. /^nPGii (5ng) 
was assayed with and without inhibitor at a concentration (rPg/PGIPI/rVG) of 3.16nl\/l and a graph with the enzyme activity over pH units was plotted 
to determine the pH optima of inhibition. (C) pH stabiiity. /\nPGIi (5ng) was assayed with and without inhibitor pre-incubated for 16h at pH vaiues of 
2.0-10.0 at 4 °C upon reconstitution in the assay buffer [at a concentration (rPg/PGiPI/rVC) of 3.16 nM] and a graph with the enzyme activity over 
pH units was plotted to determine the pH stabiiity of inhibitor. (D) Temperature stabiiity. /\nPGIi (5ng) was assayed with and without inhibitor pre- 
incubated for 1 h at temperatures ranging from 20 to 100 °C [at a concentration (rPg/PGIPI/rVC) of 3.16 nM[ and a graph with the enzyme activity 
over temperature was plotted to determine the temperature stabiiity of inhibitor. The data points are means of a singie experiment carried out in 
triplicates. Resuits are shown as means±standard error. Means designated with the same ietter are not significantiy different according to Tul<;ey's 
HSD test at P<0.05. 



and 25% inhibition against PGs from Aspergillus niger and 
Verticillium dahlia, respectively (James and Dubery, 2006). 
Wheat PGIP, on the other hand, showed just over 5% inhi- 
bition of PGs from A. niger and F. moniliforme (Kemp 
et al, 2003). PGIPs are very diverse in their inhibition 
specificity and potential, i.e. PGIPs isolated from a single 
plant can inhibit different PGs, the inhibitors isolated from 
different plants have been found to inhibit the same PG, 
some PGIPs have the ability to inhibit a multitude of PGs, 
and the degree of inhibition has also been found to vary 
greatly (Gomathi and Gnanamanickam, 2004). PGIPs have 
also been noted to have the potential to inhibit in vitro the 
PGs of those microbes that are non-pathogenic or to which 
they are generally not exposed, as in our case (Gomathi 
and Gnanamanickam, 2004). The presence of PGIPs with 
different inhibition potentials could be a ploy employed 
by the host ensuring the accumulation of elicitor-active 
oligogalacturonides. 



Mode ofAnPGII inhibition 

The control protein, rVC (1.26 and 3.16nM) had no effect 
on the kinetic parameters of ^77PGII (Table 1). rPg/PGIPl 
(1.26 and 3.16nM), however, was found to decrease the V^^^^ 
without affecting the i^^ of ^nPGII. Hence, the mode of 
inhibition was established to be non-competitive in nature. 
PGIPs depending on the PG are known to employ competi- 
tive, non-competitive, and mixed modes of inhibition, thus 
indicating that PGIPs are capable of recognizing different 
structural motifs in its various protein partners (Sicilia et al., 
2005). PGIPs from different sources have been shown to dis- 
play similar as well as different inhibition kinetics against the 
same enzyme. For example, BcFG from Botrytis cineria was 
competitively inhibited by pear PGIP but in a mixed mode by 
the inhibitor from bean (Abu-Goukh and Labavitch, 1983; 
SiciHa et al, 2005). However, ^mPGII was non-competitively 
inhibited by PGIPs of tomato and bean (Stotz et al. , 2000), 
as in our case. 
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Table 1. The kinetic parameters ofAnPGII withi and withiout rVC 
andrPQ\PGiP1 

AnPGW (5ng) was assayed using a substrate concentration range of 
0.025-0.25 mg ml"^ witli and witliout inliibitors (rPg/PGIPI/rVC) at 
concentrations of 1 .26 and 3.16nl\/l. The kinetic parameters were 
calculated by fitting the Michaelis-Menten equation on initial rate 
experimental data by non-linear fitting using OriginPro 7 (Originlab). 





(mg mr^) 


Vn,a. (Mkat 
mg"^ protein) 


AnPGll 


0.091 


28.5 


/\nPGII-i-rVC (1 .26 nM) 


0.091 


28.5 


/\nPGII-i-rVC{3.16nM) 


0.092 


28.6 


/InPGII+rPg/PGIPI (1.26nM) 


0.092 


24.3 


/InPGII+rPg/PGlPI (3.16nM) 


0.091 


21.3 



pH optima of AnPGll inliibition 

The ^wPGII inhibition by rPg/PGIPl (3.16nM) over a 
pH range of 3.5-5.0 showed the pH optima for inhibi- 
tion to be between 4.0 and 4.5 with 25 and 22% inhibi- 
tion, respectively (Fig. 3B). However, no inhibition was 
observed at pH 3.5 and 5.0. The pH optima for inhibi- 
tion of A. niger PG by PGIP-I and PGIP-II of guava fruit 
was determined to be 4.2, whereas that for PGIP-III was 
4.4 when assayed over a pH range of 4.0-5.5 (Deo and 
Shastri, 2003). A study involving interaction of Pv'PGIP2 
with five isoforms of A. niger PGs (AnVGl, AnVGW, 
AnVGK /1«PGB, and ^hPGC) showed PvVGlVl to 
greatly reduce the activity of three of the PG isoforms — 
^kPGI, AnVGW and yl«PGB— at all tested pH values. 
For ^«PGA and ^«PGC, PvVGlVl displayed inhibition 
only at pH 4.75 and below pH 4.2, respectively. At pH 5.0, 
however, Pv'PGIP2 was found to activate the two PG iso- 
forms (Kemp et ah, 2004). This diversity of PGIP activ- 
ity in different pH environments is crucial in countering 
the multitude of PGs encountered from various pathogens 
with different pH optima. In addition, such differential 
PGIP activity has been proposed to be involved in mount- 
ing a better host defence response through the generation 
of a steady-state concentration of biologically active oli- 
gogalacturonides (Kemp et al, 2004). 

pH and thermal stability of rPg\PGIP1 

rPg/PGIPl (at 3.16nM) was found to be stable over a wide 
range of pH values from 4.0 to 8.0 (Fig. 3C). The inhibition 
potential decreased by approximately 50% at pH 3.0 and 9.0, 
with no observed inhibition at other tested pH values. Guava 
PGIPs were reported to be stable only over a narrow pH 
range of 2.0^.0 (Deo and Shastri, 2003), but the PGIP from 
chilli retained >50% activity at pH 3.0 and 8.0 (Shivashankar 
el a/., 2010). 

rPglPGlPl (at 3.16nM) was found to be equally active 
from 20 to 50 °C (Fig. 3D). The activity dropped very slightly 
to 22% at 60 °C and significantly at 70 and 80 °C to 11 and 
3%, respectively. No inhibition was observed beyond this 



temperature. Earlier studies on thermal tolerance of the 
protein have shown it to be relatively thermostable. PGIPs 
from 'Bartlett' pear (Abu-Goukh et al., 1983) and orange 
(Barmore and Nguyen, 1985) retained significant activity at 
60 °C. The latest study on PGIP isolated from tomato was 
found to retain partial inhibitory activity, even at 100 °C 
(Schacht et al., 2011). 

In silico analysis of PglPGIPl -AnPGll and Pg\PGIP1- 
FmPGIII complexes 

In silico protein modelling, docking, and computational 
mutagenesis studies were carried out to account for the dif- 
ferential behaviour of pearl millet PGIP against ^hPGII and 
FmFGlll, and for the limited similarity in the inhibition of 
PglPGlFl and PvPGIP2 against the same two PGs. It was 
necessary to predict the underlying structural basis of inter- 
action as the two PGIPs share an amino acid identity of just 
37%. 

Homology modelling of Pg\PGIP1 and FmPGIII 

The initial homology models of PglPGIPl (Supplementary 
Fig. S7A at JXB online) and FmPGIII (Supplementary 
Fig. S7B) were generated using already known structures of 
PvPGlPl (PDB ID: lOGQ) and FmPGl (PDB ID: 1HG8) 
serving as templates, respectively. The quality of the models 
was assessed using Verify3D. Scores predicted for each resi- 
due of our constructed model were >0.1, indicating that all 
of the residues were located in favourable structural environ- 
ments. After post-refinement with the KoBa'^'^ program, the 
final models were revalidated using MolProbity. In the present 
study, the MolProbity score for both the models was >85%, 
representing a good quality model. Summarized results from 
KoBa^'^ and MolProbity are given in Supplementary Table 
S4 at JXB online. Models further energy minimized using 
GROMOS96-53a6 force field to remove any local strains were 
finally used for the docking studies. 

Docking studies of PglPGIPl -AnPGll and Pg\PGIP1- 
FmPGIII complexes 

To predict the conformation and the putative interactions 
between PglPGIPl with ^«PGII and FmPGlll, two different 
protein-protein docking programs, GRAMM-X and Rosetta 
3.4, were used. Firstly, using GRAMM-X, fast-Fourier- 
transformation-based unrestrained rigid body docking was 
performed, which generated the top 10 solutions of the com- 
plexes. In the absence of the detailed data on the binding mode 
or mutational studies on PglPGIPl, we selected the docked 
complex based on the present in vitro inhibition studies, as 
well as available experimental evidence from PvPGIP2 inter- 
actions with the two enzymes (Leckie et al, 1999; Federici 
et al., 2001; King et al, 2002; Di Matteo et al, 2003; Sella 
et al, 2004; MauHk et al, 2009; Benedetti et al, 2011). For 
further optimization, selected models from GRAMM-X were 
subjected to Rosetta 3.4. This docking algorithm searches a 
set of conformations from a given starting conformation for 
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the optimal fit between the two partners. It employs a Monte 
Carlo search followed by simultaneous optimization of side- 
chain conformations. The resulting 'decoys' obtained from 
the docking simulations were ranked using an energy function 
dominated by van der Waals interactions, an implicit solva- 
tion model and an orientation-dependent hydrogen bonding 
potential. Selection of the best docked conformation obtained 
from Rosetta 3.4 was based on: (i) docking score being an 
overall measure of the energy of the complex; (ii) an interface 
score representing the score of the complex minus the total 
score of each partner in isolation; and (iii) involvement of 
any residues in the interaction evident from the experimental 
studies from PvPGIP2. The best docked conformations of 
the PglPGWl AnPGll and PglPGlFl Fji^FGlll complexes 
are shown in Fig. 4, and the docking and interface scores are 
presented in Supplementary Table S5 at JXB online. 

From the docked protein complexes, it was suggested that 
AnFGll (Fig. 4A) and FmPGlll (Fig. 4B) both interact with 
PglPGlPl at its concave surface, but their binding orienta- 
tions differ. The residues at the (3-strand/p-turn motif of the 
central LRR domain constitute the solvent-exposed concave 
surface of the PGIP and this region determines the binding 
specificity for PGs (Kobe and Deisenhofer, 1994). LRRs 
are known to be versatile protein recognition domains pre- 
sent in over 14,000 proteins (Matsushima and Miyashita, 
2012). In case of the Pg/PGIPl-^nPGII complex, the 



A 




Fig. 4. Protein docking anaiysis. Docl<ed poses of Pg/PGiP1-/\nPGIi (A) 
and Pg/PGiPI-FmPGili (B) compiexes. Pg/PGIP1 interacts througli its 
soivent-exposed concave cavity with AnPGil and FmPGill at their N- and 
G-termini (circied in biacl<;), respectiveiy. The substrate-binding site in 
FmPGIil appears to be more exposed compared with that oiAnPGW. (This 
figure is available in colour at JXB online.) 



concave site of Pg/PGIPl interacts with the N-terminal 
site of y4nPGII, whereas in the Pg/PGIPl-f/wPGIII com- 
plex, the concave site of Pg/PGIPl interacts with the 
C-terminal site of FmFGlll. The substrate-binding site 
in Fm¥Glll appeared to be more exposed compared with 
that of ^hPGII. In terms of the docking score, binding 
of PglPGlFl with ^«PGII was predicted to be stronger 
in comparison with FmPGIII. These in silico results are 
consistent with the in vitro outcomes. Interaction of 
PvPGIP2 with ^«PGII and FinPGl showed some residues 
important for interaction with one PG to be dispensable 
for interaction with the other, suggesting that different but 
overlapping subsets of residues are vital in binding differ- 
ent ligands (Leckie et al, 1999). The molecular docking 
simulation of the 5cPGl-PvPGIP2 complex showed the 
Bl -sheet of PvVGWl to interact with the N-terminus of 
5cPGl, and the active site of BcVGl was partially buried 
by the PvPGIP2 C-terminus (Siciha et al., 2005). Analysis 
of the PvPGIP2-Fw7PGl (now referred to as i=>PG) com- 
plex showed the residues present at both the convex and 
concave side of the PGIP N-terminus to be involved in 
interaction with the loops surrounding the active site of 
the PG (Benedetti et al., 2011). These studies further dem- 
onstrated the structural flexibility and the versatility of 
PGIP binding interactions with the various PGs. 

Evaluation of protein-protein interactions in Pg\PGIP1- 
AnPGII and PglPGlPl -FmPGlll complexes by PIC 
analysis 

Energy-minimized Pg/PGIPl-y4«PGII and PglPGlPl 
FotPGIII complexes were analysed using the PIC server to 
predict all possible types of interactions at the protein-pro- 
tein interfaces, and the putative amino acid residues involved 
were mapped (Table 2). It was apparent from the data that 
the protein-protein contacts in both complexes were medi- 
ated through all types of interactions, i.e. ionic, hydropho- 
bic, and hydrogen bonds, whereas the ionic and hydrophobic 
interactions were predominating at the surface. However, 
in the docked as well as energy-minimized structures, the 
/'g/PGIPl-^77PGII complex was found to have a stronger 
binding interaction than that recorded in the PglPGlPl 
F777PGIII complex. 

Electrostatic surface charge distribution on PglPGlPl, 
AnPGII, and FmPGlll individually and the PglPGIPl- 
AnPGII and PglPGlPl -FmPGlll complexes 

To illustrate the charge distributions of molecules, a three- 
dimensional electrostatic surface potential was generated 
separately on PglPGlPl (Fig. 5A), AnPGII (Fig. 5B), and 
FmPGlll (Fig. 5C), as well as on the fg/PGIPl-^/iPGII 
(Fig. 5D) and PglPGlPl FmPGlll (Fig. 5E) complexes. 

The surface comparison of AnPGll and FmPGlll sug- 
gested that the structures not only differed with respect 
to charge distribution but also differed in shape. This dif- 
ference in the surface potential of the individual enzymes 
was also reflected in their predicted differential binding 
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Table 2. Protein interaction analysis of Pg\PGIP1-AnPGII and 
PglPGiPI-fmPGIIi compiexes using tlie PIC 

The residue pairs involved in the interacting complexes, sorted 
according to the type of interaction, are shown (the PGIP residue 
numbering followed excludes the putative signal peptide). 



Hydrophobic interactions 


In Pg/PGIP1-/lnPGII 




In Pg/PGIP1-FmPGIII 


complex 




complex 




Pg/PGIP1 


>lnPGII 


Pg/PGIP1 


FmPGIII 


F54 


Y130 


W105 


A306 


M100 


W85 


W243 


1332 


1102 


W85 


L268 


A330 


W105 


A43 






F124 


P56 






F129 


A40 






A172 


P56 






VI 75 


A36 






Side-ohain H-bonding interactions 






T28 


S234 


R153 


N266 


H79 


T64 


D222 


T332 


N145, N147 


E83 






SI 95 


E54 






Q219 


E54 






Ionic interactions 








D31 


R233 


D56 


K269 


D42 


K124 


D290 


K300 


D50 


K127 






R74 


E83, E84 






L77 


D62 






H79 


D62 






D126 


K39 






R240 


E54 







interactions with PglPGlPl. Once again, the fg/PGIPl- 
^«PGII complex was more compact than Pg/PGIPl- 
FmFGlll, with the active site cleft of FmFGlll exposed to 
a greater extent, which further explicated the in vitro results. 
This is consistent with an earlier report that electrostatic 
and van der Waals interactions play a significant role in the 
proper recognition and discrimination of PGs by PGIPs 
(Maulik era/., 2009). 

Analysis of Pg\PGIP1-Ar\PGII interaction by 
computational alanine-scanning mutagenesis 

The residues at the (3-strand/p-turn motif of PvVGIPl have 
been shown to interact with ^«PGII at the DUO a-helix, 
opposite the substrate-binding site, which fits perfectly 
with the non-competitive mode of inhibition (Stotz et al. , 
2000). To identify the hotspot residues involved in the inter- 
action of Pg/PGIPl and ^«PGII, which also follows the 
non-competitive mode of inhibition, a computative ala- 
nine mutagenesis of residues at the interface was carried 
out. Virtual scanning was performed over all interface resi- 
dues and changes in the binding free energy were calculated 
upon alanine substitution of residues at protein-protein 
interfaces. 



From the existing experimental data on PvPGIP2, the 
amino acids H104, Y105, Y107, D131, V152, F201, Q224, 
and K225 have been reported to be important in interac- 
tion with ^«PGII (Leckie et al, 1999; Casasoh et al, 2009; 
Spinelli et al, 2009; Benedetti et al, 2011). Alignment of 
PvVGlVl and Pg/PGIPl sequences identified PglPGWl resi- 
dues T99, MlOO, 1102, D126, N147, Q196, Q219, and 1220, 
respectively, at positions corresponding to the PvVGWl 
residues mentioned above. Interestingly, computational 
mutagenesis predicted six of them to be hotspot residues sig- 
nificantly involved in interaction, except for 1102 and 1220, 
with AAGbinding values of 0.61 and 0.87 kcal mol respec- 
tively (Fig. 6A). In addition, all the identified residues could 
be pinned down to the p-strand/(3-turn solvent-exposed 
region except for D31, NHS, and R240 found locaHzed at 
the N-terminus, LRR-4 and LRR-8 in the central domain 
(Fig. 1), respectively. 

The information on ^mPGII residues involved in interac- 
tion with PGIPs is very limited. Studies on conformational 
changes in ^hPGII-HG-PvPGIP2 using amide-exchange 
mass spectrometry identified four residues (E95, G104, 
DUO, and 1139) to be involved in PvPGIP2 binding (King 
et al, 2002). The residues were found to He opposite the sub- 
strate-binding site around the underside of the barrel near 
the DUO a-helix, consistent with the reported non-competi- 
tive mode of inhibition. However, the docking pose and pro- 
tein-protein interaction analysis as well as the computational 
alanine-scanning mutagenesis of the PglPGWl AnVGW 
complex (Fig. 6A, B) identified that the N-terminal region 
of ^«PGII interacts with the concave surface of Pg/PGIPl. 
Many of the identified residues were found to be locaHzed 
mainly at the small a-helix, and p-strands of p-sheet PBl at 
the N-terminus of ^«PGII away from the substrate-binding 
surface, which again is consistent with the observed non- 
competitive mode of inhibition. 

Most residues of Pg/PGIPl and ^«PGII identified by pro- 
tein-protein interaction analysis as involved in interaction 
were also identified as significantly important upon computa- 
tional alanine-scanning mutagenesis (Table 2 and Fig. 6). The 
D126 (/'g/PGIPl)-K39 (^«PGII) interaction was predicted 
by alanine mutation studies to be the most significant binding 
contact in the /'g/PGIPl-^77PGII complex. 

The above results predicted the preferential use of very 
similar motif regions in PG recognition by the two PGIPs 
with few identical amino acids. PvVGWl shares 99 and 
88% identity with PvPGIPl and Gn7PGIP3, respectively, 
at the amino acid level. However, assessment of their inhi- 
bition against ^«PGII and FmVGl, unexpectedly showed 
PvVGWl and G777PGIP3 to share similar inhibition profiles, 
but /'VPGIP2 and PvPGIPl did not (Leckie et al, 1999; 
D'Ovidio et al, 2006). Docking studies of these complexes 
proposed that not just sequence similarity but also con- 
servation of key structural features are crucial in preserv- 
ing the function mediated by appropriate protein— protein 
interactions (MauHk et al, 2009). Even though Pg/PGIPl 
and PvPGIP2 employed similar motifs for interaction with 
AnVGll at non-substrate-binding sites, they engaged differ- 
ent regions of ^nPGII. Subtle differences in the overlapping 
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Pg/PGIPl 
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Fig. 5. Electrostatic surface potential of individual proteins and protein complexes. Electrostatic potential maps of Pg/PGIP1 (A), AnPGW (B), FmPGIII 
(C), Pg/PGIP1-/lnPGII (D), and Pg/PGIPI-FmPGIII (E) complexes on which surface colours are fixed at red (-5) or blue (+5). Marl<ed regions display the 
difference in charge distributions in surface maps of the individual proteins. 
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Fig. 6. Computational alanine mutagenesis of Pg/PGIP1-/\nPGII interface residues. The plot displays the contribution of individual interacting residues 
from Pg/PGIPI (the PGIP residue numbering followed excludes the putative signal peptide) (A) and AnPGW (B) in the stability of the Pg/PGIP1-/\nPGII 
complex. Interface residues were defined as those residues with a side chain having at least one atom within a sphere with 4 A radius of an atom 
belonging to the other partner in the complex and binding hotspots defined as those residues that show AAGynding >1 kcal mol"''. 



residues as well as recruitment of additional, completely 
different amino acids could be responsible for the observed 
binding disparity. 

In silico alanine mutation studies predicted D126 
(Pg/PGIP1)-K39 (AnVGW) interaction as the single most 
important binding contact in the /'g/PGIPl-y4«PGII com- 
plex. This could be important as the D126 position is an 



evolutionarily conserved residue in PGIPs. The importance of 
a single amino acid in protein-protein interaction has been 
observed earlier. PvPGIPl gained the ability to inhibit FniPGl 
through a single amino acid substitution of K224 into the cor- 
responding amino acid of /'vPGIP2, a Q (Leckie et al, 1999). 
Hence, in silico analysis of the PglPGWl AnVGll complex 
is a good starting point for further experimental mutational 
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analysis to arrive at the actual residues involved in protein- 
protein interactions. 

In conclusion, the present study, together with earlier 
literature, suggests that the PG-PGIP interactions are 
complex, and structural and mutational analyses of vari- 
ous PG-PGIP complexes would be needed before a com- 
prehensive generalized conclusion could be drawn about 
structure-function correlation. Pearl millet is afflicted by 
many fungal and bacterial diseases. Downy mildew dis- 
ease alone accounts for annual production losses in the 
range of 20-40% (Singh, 1995). Such protein-protein 
interaction studies will be crucial from the perspective of 
generation of designer host proteins with improved com- 
bat potential against the ever-evolving pathogen virulence 
factors. 
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